SEQ ID NO: 12 0 



ycfE protein - Escherichia coli (strain K-12) cryptic prophage el4 
C; Species: Escherichia coli 

C;Date: 12-Sep-1997 #sequence_revision 17-Sep-1997 #text_change 01-Mar- 
2002 

C;Accession: B64861; S07185 

R;Blattner, F.R.; Plunkett III, G.; Bloch, C . A . ; Perna, N.T.; Burland, 
V.; Riley, M . ; Collado-Vides , J.; Glasner, J.D.; Rode, C.K.; Mayhew, 
G.F.; Gregor, J.; Davis, N.W.; Kirkpatrick, H . A . ; Goeden, M.A.; Rose, 
D.J. ; Mau, B. ; Shao, Y. 
Science 277, 1453-1462, 1997 

A;Title: The complete genome sequence of Escherichia coli K-12. 
A;Reference number: A64720; MUID : 97426617 ; PMID:9278503 
A; Accession : B64861 

A; Status : nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-179 <BLAT> 

A; Cross-references : GB : AE0 0 02 14 ; GB : U0 0 0 96 ; NID:gl7 873 82 ; 

PIDN:AAC74241.1; PID : gl7 8 74 0 3 ; UWGP:bll57 

A; Experimental source: strain K-12, substrain MG1655 

R;Plasterk, R.H.A.; van de Putte, P. 

EMBO J. 4, 237-242, 1985 

A;Title: The invertible P-DNA segment in the chromosome of Escherichia 
coli . 

A;Reference number: S07185; MUID : 85257443 ; PMID:3894006 

A; Accession : S07185 

A; Status : translation not shown 

A; Molecule type: DNA 

A; Residues: 1 IKSDGTVQTALE ' ,13, »L' ,15-164, ' AVRVILHLTIS ' <PLA> 

A; Cross-references : EMBL : X01805 

C; Genetics : 

A; Gene: ycfE 

A;Map position: 25 min 

A;Genome: cryptic prophage el4 

C; Superf amily : phage PI major tail fiber protein S 

Query Match 1.6%; Score 8; DB 2; 

Best Local Similarity 100.0%; Pred . No. 4.8; 
Matches 8; Conservative 0; Mismatches 
Gaps 0; 



Length 179; 

0; Indels 0; 



Qy 2 0 GLGEGSAL 2 7 

MINIM 

Db 15 GLGEGSAL 2 2 



ID AAB43919 standard; Protein; 602 AA . ~\ , , _ ^ ; ") 

XX i ' 

AC AAB4 3 919; 
XX 

DT 08-FEB-2001 (first entry) 
XX 

DE Human cancer associated protein sequence SEQ ID NO: 1364. 
XX 

KW Human; cancer associated gene; cancer antigen; detection; cancer; 

KW diagnosis; cytostatic; proliferative; vulnerary; immunomodulator; 

KW antidiabetic; antiasthmatic; antirheumatic; antiarthri t ic ; antiviral; 

KW antiinflammatory; antithyroid; antiallergic; antibacterial; cardiant; 

KW dermatological ; neuroprotective; thrombolytic; coagulant; nootropic; 

KW vasotropic; ant ipsoriat ic ; antiangiogenic ; gene therapy; inflammation; 

KW immune disorder; haematopoietic cell disorder; autoimmune disorder; 

KW allergic reaction; graft versus host disease; organ rejection; 

KW haemostatic; thrombolytic; cardiovascular disorder; infection; 

KW neurological disease; drug screening. 

XX 

OS Homo sapiens . 
XX 

PN WO200055350-A1 . 
XX 

PD 21-SEP-2000. 
XX 

PF 08-MAR-2000; 2000WO-US05882 . 
XX 

PR 12-MAR-1999; 99US - 0 124 2 7 0 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Ruben SM; 
XX 

DR WPI; 2000-587533/55. 

DR N-PSDB; AAC78128. 
XX 

PT Novel isolated nucleic acids comprising sequences encoding peptides 

PT useful for treating or diagnosing e.g. cancer - 

XX 

PS Claim 11; Page 2024-2026; 2352pp; English. 
XX 

CC AAC77607 to AAC78448 encode the human cancer associated proteins given 

CC in AAB43398 to AAB44239. The proteins can have activities based on the 

CC tissues and cells the genes are expressed in. Example of activities 

CC include: cytostatic; proliferative; vulnerary; immunomodulator; 

CC antidiabetic; antiasthmatic; antirheumatic; ant iarthritic ; 

CC antiinflammatory; antithyroid; antiallergic; antibacterial; antiviral; 

CC dermatological; neuroprotective; cardiant; thrombolytic; coagulant; 

CC nootropic; vasotropic; ant ipsoriat ic and antiangiogenic. The 

CC polynucleotides and polypeptides can be used for preventing, treating or 

CC ameliorating medical conditions and diagnosing pathological conditions. 

CC Polynucleotides, polypeptides, antibodies, agonists and antagonists from 

CC the present invention may be used to treat immune disorders by activating 

CC or inhibiting the proliferation, differentiation or mobilisation of 

CC immune cells, to treat disorders of haematopoietic cells, autoimmune 

CC disorders, allergic reactions, graft versus host disease and organ 

CC rejection, modulate haemostatic or thrombolytic activity, modulate 







• • 




/-I /-I 

LL. 


inflammation, cancers, cardiovascular disorders, neurological disease 


and 


HC 


bacterial or viral infections. The peptides, nucleotides, antibodies, 






agonists 


and antagonists may be also be used in drug screens. AAC78449 to 


LC 


AAC78457 


and AAB44240 represent sequences used in the exemplification 


of 


CC 


the present invention. 




XX 








SQ 


Sequence 


602 AA; 




Query Match 


99.8%; Score 2816; DB 21; Length 602; 




Best Local 


Similarity 99.8%; Pred. No. 2e-253; 




Matches 513; Conservative 0; Mismatches 1; Indels 0; Gaps 


0 


Qy 


1 


MSWPRRLLLRYLFPALLLHGLGEGSALLHPDSRSHPRSLEKSAWRAFKESQCHHMLKHLH 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 i I 1 \ I I I I I I I i I i i i ( \ i i i i i i i i i i i i i i i t i i i 


60 


Db 


89 


1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

MSWPRRLLLRYLFPALLLHGLGEGSALLHPDSRSHPRSLEKSAWRAFKESQCHHMLKHLH 


148 


Qy 


61 


NGARITVQMPPTIEGHWVSTGCEVRSGPEFITRSYRFYHNNTFKAYQFYYGSNRCTNPTY 

1 t 1 1 1 1 i 1 ! 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 l l l l l l t l t i i i i i i i i i 


120 


Db 


149 


M 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 | 1 1 1 | | | | | | M | M | | | | | || | | | | | | | | | | | | | | | 

NGARITVQMPPTIEGHWVSTGCEVRSGPEFITRSYRFYHNNTFKAYQFYYGSNRCTNPTY 


208 


Qy 


121 


TLI IRGKIRLRQASWI IRGGTEADYQLHNVQVICHTEAVAEKLGQQVNRTCPGFLADGGP 

1 1 1 1 1 1 1 I 1 1 1 1 1 ! 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i 


180 


Db 


209 


1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 1 

TLI IRGKIRLRQASWI IRGGTEADYQLHNVQVICHTEAVAEKLGQQVNRTCPGFLADGGP 


268 


Qy 


181 


WVQDVAYDLWREENGCECTKAVNFAMHELQLIRVEKQYLHHNLDHLVEELFLGDIHTDAT 

|j|i|-|jr|||j| ilJtJ I iritJ Clii ijiii i riijirii»ii irj>rii#L ^ 


240 


Db 


269 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 | | 1 | | | | | | | | || 

WVQDVAYDLWREENGCECTKAVNFAMHELQLIRVEKQYLHHNLDHLVEELFLGDIHTDAT 


328 


Qy 


241 


QRMFYRPSSYQPPLQNAKNHDHACIACRIIYRSDEHHPPILPPKADLTIGLHGEWVSQRC 


300 


Db 


329 


1 M 1 1 i 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIIIIIIIIIIIIIIIIMIIIIIIIIII 

QRMFYRPSSYQPPLQNAKNHDHACIACXI IYRSDEHHPPILPPKADLTIGLHGEWVSQRC 


3 8 8 


Qy 


301 


EVRPEVLFLTRHFIFHDNNNTWEGHYYHYSDPVCKHPTFSIYARGRYSRGVLSSRVMGGT 


360 


Db 


389 


MMIMIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIMIIIIIIIIIIIIMIMI 

EVRPEVLFLTRHFIFHDNNNTWEGHYYHYSDPVCKHPTFSIYARGRYSRGVLSSRVMGGT 


448 


Qy 


361 


EFVFKVNHMKVTPMDAATASLLNVFNGNECGAEGSWQVGIQQDVTHTNGCVALGIKLPHT 


420 


Db 


449 


MIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIMI 

EFVFKVNHMKVTPMDAATASLLNVFNGNECGAEGSWQVGIQQDVTHTNGCVALGIKLPHT 


508 


Qy 


421 


EYEIFKMEQDARGRYLLFNGQRPSDGSSPDRPEKRATSYQMPLVQCASSSPRAEDLAEDS 


480 


Db 


509 


IIIIIIIMIIIMIMIMIIMIIIMIIMIIIIIIIMIIIIIIIIIIIIIIMII 

EYEIFKMEQDARGRYLLFNGQRPSDGSSPDRPEKRATSYQMPLVQCASSSPRAEDLAEDS 


568 


Qy 


481 


GSSLYGRAPGRHTWSLLLAALACLVPLLHWNIRR 514 




Db 


569 


IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

GSSLYGRAPGRHTWSLLLAALACLVPLLHWNIRR 6 02 





ID AAC78128 standard; cDNA; 2162 BP. 
XX 

AC AAC7 812 8; 
XX 

DT 08-FEB-2001 (first entry) 
XX 

DE Human cancer associated gene sequence SEQ ID NO: 522. 
XX 

KW Human; cancer associated gene; cancer antigen; detection; cancer; 

KW diagnosis; cytostatic; proliferative; vulnerary; immunomodulator; 

KW antidiabetic; antiasthmatic; antirheumatic; antiarthri tic ; antiviral; 

KW antiinflammatory; antithyroid; antiallergic; antibacterial; cardiant; 

KW dermatological ; neuroprotective; thrombolytic; coagulant; nootropic; 

KW vasotropic; ant ipsoriat ic ; antiangiogenic ; gene therapy; inflammation; 

KW immune disorder; haematopoietic cell disorder; autoimmune disorder; 

KW allergic reaction; graft versus host disease; organ rejection; 

KW haemostatic; thrombolytic; cardiovascular disorder; infection; 

KW neurological disease; drug screening; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200055350-A1. 

XX 

PD 21-SEP-2000. 
XX 

PF 08-MAR-2000; 2000WO-US05882 . 
XX 

PR 12-MAR-1999; 99US - 0 124 2 7 0 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Ruben SM; 
XX 

DR WPI; 2000-587533/55. 

DR P-PSDB; AAB43919. 
XX 

PT Novel isolated nucleic acids comprising sequences encoding peptides 

PT useful for treating or diagnosing e.g. cancer - 
XX 
PS 
XX 

CC AAC77607 to AAC78448 encode the human cancer associated proteins given 

CC in AAB43398 to AAB44239. The proteins can have activities based on the 

CC tissues and cells the genes are expressed in. Example of activities 

CC include: cytostatic; proliferative; vulnerary; immunomodulator; 

CC antidiabetic; antiasthmatic; antirheumatic; ant iarthr i t ic ; 

CC antiinflammatory; antithyroid; antiallergic; antibacterial; antiviral; 

CC dermatological; neuroprotective; cardiant; thrombolytic; coagulant; 

CC nootropic; vasotropic; antipsoriat ic and antiangiogenic. The 

CC polynucleotides and polypeptides can be used for preventing, treating or 

CC ameliorating medical conditions and diagnosing pathological conditions. 

CC Polynucleotides, polypeptides, antibodies, agonists and antagonists from 

CC the present invention may be used to treat immune disorders by activating 

CC or inhibiting the proliferation, differentiation or mobilisation of 

CC immune cells, to treat disorders of haematopoietic cells, autoimmune 

CC disorders, allergic reactions, graft versus host disease and organ 

CC rejection, modulate haemostatic or thrombolytic activity, modulate 



Claim 1; Page 1051-1052; 2352pp ; English. 



CC inflammation, cancers, cardiovascular disorders, neurological disease and 

CC bacterial or viral infections. The peptides, nucleotides, antibodies, 

CC agonists and antagonists may be also be used in drug screens. AAC78449 to 

CC AAC78457 and AAB44240 represent sequences used in the exemplification of 

CC the present invention. 

XX 



SQ 


Sequence 


2162 BP; 462 A; 683 C; 594 G ; 412 T; 11 Other; 




Alignment Scores: 

Pred. No. : 0 Length: 2162 
Score: 413.00 Matches: 513 
Percent Similarity: 99.61% Conservative: 0 
Best Local Similarity: 99.61% Mismatches: 1 
Query Match: 80.35% Indels: 2 
DB: 21 Gaps: 0 




US-09 


-720 


-533 


-120 (1-514) x AAC78128 (1-2162) 




Qy 




1 


MetSerTrpProArgArgLeuLeuLeuArgTyrLeuPheProAlaLeuLeuLeuHisGly 2 0 

1 II 1 I 1 1 1 1 1 1 1 I M 1 1 1 1 II 1 M 1 If 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i M i i 


Db 




266 


1 i i 1 M II II ill 1 II 1 I 1 1 II 1 I 1 1 II II 1 II II 1 1 1 M II 

ATGTCCTGGCCGCGCCGCCTCCTGCTCAGATACCTGTTCCCGGCCCTCCTGCTTCACGGG 


325 


Qy 




21 


LeuGlyGluGlySerAlaLeuLeuHisProAspSerArgSerHisProArgSerLeuGlu 

i i i i i i i i i i i i i i i i i f i i i i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i 


40 


Db 




326 


II 1 M M M M M M 1 M 1 M II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 M 1 1 1 1 1 1 1 

CTGGGAGAGGGTTCTGCCCTCCTTCATCCAGACAGCAGGTCTCATCCTAGGTCCTTAGAG 


385 


Qy 




41 


LysSerAlaTrpArgAlaPheLysGluSerGlnCysHisHisMetLeuLysHisLeuHis 

I I I I i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i 


60 


Db 




386 


M 1 1 M 1 1 II 1 1 1 II II 1 II 1 1 1 1 M M 1 I M 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II M I I 

AAAAGTGCCTGGAGGGCTTTTAAGGAGTCACAGTGCCATCACATGCTCAAACATCTCCAC 


445 


Qy 




61 


AsnGlyAlaArglleThrValGlnMetProProThrlleGluGlvHisTrpValSerThr 


80 


Db 




446 


1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 I I M 1 1 1 1 I 1 1 1 1 1 1 1 II I || I | | | | | | | | M | | | | 
AATGGTGCAAGGATCACAGTGCAGATGCCACCTACAATCGAGGGCCACTGGGTCTCCACA 


505 


Qy 




81 


GlyCysGluValArgSerGlyProGluPhelleThrArgSerTyrArgPheTyrHisAsn 

MIIIIIIIIIIIMIIIMMIMIIIIIIIIMIIMMMIIIIIIIIMIMIMI 


100 


Db 




506 


GGCTGTGAAGTAAGGTCAGGCCCAGAGTTCATCACAAGGTCCTACAGATTCTACCACAAT 


565 


Qy 




101 


AsnThrPheLysAlaTyrGlnPheTyrTyrGlySerAsnArgCysThrAsnProThrTyr 


120 


Db 




566 


1 M 1 1 M 1 1 1 1 1 ! F 1 1 f 1 1 !! 1 1 1 1 r 1 f 1 I M 1 1 1 M 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 M 1 1 1 1 

AACACCTTCAAGGCCTACCAATTTTATTATGGCAGCAACCGGTGCACAAATCCCACTTAT 


625 


Qy 




121 


ThrLeuIlelleArgGlyLysIleArgLeuArgGlnAlaSerTrpIlelleArgGlyGly 

IIIMIIIIIIIIIIMIIIIIIIIIIMIIMIIMIIIIIIMIIIIIIIIMIIIII 


140 


Db 




626 


ACTCTCATCATCCGGGGCAAGATCCGCCTCCGCCAGGCCTCCTGGATCATCCGAGGGGGC 


685 


Qy 




141 


ThrGluAlaAspTyrGlnLeuHisAsnValGlnVallleCysHisThrGluAlaValAla 


160 


Db 




686 


IIMIIIIIMIIIMIMMIMIIIIMIIMIIMIIMIIIIIMMIIIIIIIII 

ACGGAAGCCGACTACCAGCTGCACAACGTCCAGGTGATCTGCCACACAGAGGCGGTGGCC 


745 


Qy 




161 


GluLysLeuGlyGlnGlnValAsnArgThrCysProGlyPheLeuAlaAspGlyGlyPro 


180 


Db 




746 


MIIMIIMIMIIIIIMIIIIIIIIMMMIIIMIMIIIIIMMIIMIMM 

GAGAAGCTCGGCCAGCAGGTGAACCGCACATGCCCGGGCTTCCTCGCAGACGGGGGTCCC 


805 


Qy 




181 


TrpValGlnAspValAlaTyrAspLeuTrpArgGluGluAsnGlyCysGluCysThrLys 

MIIIIIIMIIIIIIIIIIMMMMMIMIMMIIIMIIIIIIMIIIIIMM 


200 


Db 




806 


TGGGTGCAGGACGTGGCCTATGACCTCTGGCGAGAGGAGAACGGCTGTGAGTGCACCAAG 


865 




Qy 


201 


AlaValAsnPheAlaMetHisGluLeuGlnLeuIleArgValGluLysGlnTyrLeuHis 


220 


Db 


866 


! M ! M 1 I M ! M 1 1 i 1 1 ; i : 1 ! : i i ! M ; ! ■ M M i M ! ! ! ! 1 ! 1 M M ! h ! ! , ! 1 ! 1 

GCCGTGAACTTTGCCATGCATGAACTTCAGCTCATCCGGGTGGAGAAGCAGTACCTTCAC 


925 


Qy 


2 21 


HisAsnLeuAspHisLeuValGluGluLeuPheLeuGlyAspIleHisThrAspAlaThr 

1 II 1 1 1 1 1 ! 1 1 ! ! 1 1 1 1 1 i 1 i ! 1 i 1 ! ! 1 ! 1 1 1 I ! ! i i M i 1 i ! ! 1 M ! 1 ! M ] 1 ! i 1 I |[ 


240 


Db 


926 


CACAACCTCGACCACCTGGTCGAGGAGCTCTTCCTTGGTGACATTCACACTGATGCCACC 


985 


Qy 


241 


GlnArgMetPheTyrArgProSerSerTyrGlnProProLeuGlnAsnAlaLysAsnHis 


260 


Db 


986 


1 1 1 1 1 1 ! 1 1 1 1 M I 1 1 I I 1 1 1 1 i i i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 

CAGAGGATGTTCTACCGGCCCTCCAGTTACCAGCCCCCTCTGCAGAATGCCAAGAACCAC 


1045 


Qy 


261 


AspHisAlaCysIleAlaCysArg-IlelleTyrArgSerAspGluHisHisProProIl 


280 


Db 


1046 


1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 ( I I I | 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

GACCATGCCTGCATCGCCTGTSG-GATCATCTATCGGTCAGACGAGCACCACCCTCCCAT 


1104 


Qy 


280 


eLeuProProLysAlaAspLeuThrlleGlyLeuHisGlyGluTrpValSerGlnArgCy 

1 1 1 1 J 1 I j 1 I 1 | | 1 | 1 [ 1 J 1 I 1 1 I 1 1 j | I | 1 I 1 | | t , * t tt ,t V ■ 

M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I I M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 


300 


Db 


1105 


CCTGCCCCCAAAGGCAGACCTGACCATCGGCCTGCACGGGGAGTGGGTGAGCCAGCGCTG 


1164 


Qy 


300 


sGluValArgProGluValLeuPheLeuThrArgHisPhellePheHisAspAsnAsnAs 


320 


Db 


1165 


1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | || 

TGAGGTGCGCCCCGAAGTCCTCTTCCTCACCCGCCACTTCATCTTCCATGACAACAACAA 


1224 


Qy 


320 


nThrTrpGluGlyHisTyrTyrHisTyrSerAspProValCysLysHisProThrPheSe 


340 


Db 


1225 


M 1 1 M II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 M | | 

CACCTGGGAGGGCCACTACTACCAGTACTCAGACCCGGTGTGCAAGCACCCCACCTTCTC 


1284 


Qy 


340 


rlleTyrAlaArgGlyArgTyrSerArgGlyValLeuSerSerArgValMetGlyGlyTh 

1 j i i t i \ i i i f i i i i i i 1 i i i i ■ ■ ■ i j ^ 


360 


Db 


1285 


MMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIM 

CATCTACGCCCGGGGCCGCTACAGCCGCGGCGTCCTCTCGTCCAGGGTCATGGGAGGCAC 


1344 


Qy 


360 


rGluPheValPheLysValAsnHisMetLysValThrProMetAspAlaAlaThrAlaSe 

1 i 1 j i i t i i i i i i i i i i \ t i i i i i i t i j , 


380 


Db 


1345 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 MINI 1 1 1 II II 1 1 II 1 1 I I | I | | | | | | | | 

CGAGTTCGTGTTCAAAGTGAATCACATGAAGGTCACCCCCATGGATGCGGCCACAGCCTC 


1404 


Qy 


380 


rLeuLeuAsnValPheAsnGlyAsnGluCysGlyAlaGluGlySerTrpGlnValGlyll 

1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i , 


400 


Db 


1405 


1 1 N 1 N 1 1 1 1 1 1 1 M 1 1 1 1 II M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 

ACTGCTCAACGTCTTCAACGGGAATGAGTGCGGGGCCGAGGGCTCCTGGCAGGTGGGCAT 


1464 


Qy 


400 


eGlnGlnAspValThrHisThrAsnGlyCysValAlaLeuGlylleLysLeuProHisTh 

i i i i i i i i i i i i i i i i i i i i i 


420 


Db 


1465 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | 1 1 | | | | | | | | MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCAGCAGGATGTGACCCACACCAATGGCTGCGTGGCCCTGGGCATCAAACTACCTGACAC 


1524 


Qy 


420 


TGluTyrGluIlePheLysMetGluGlnAspAlaArgGlyArgTyrLeuLeuPheAsnGl 

1 1 1 1 ] j t 1 1 1 j 1 i i i i i i j i j i i i i i i i i * . , 


440 


Db 


1525 


1 1 1 1 MINI 1 1 1 1 1 1 1 1 1 1 I I 1 I 1 1 1 | | | | | | | | | 1 1 1 | | | | | | | || | 1 1 1 1 1 | || || | 

GGAGTACGAGATCTTCAAAATGGAACAGGATGCCCGGGGGCGCTATCTGCTGTTGAACGG 


1584 


Qy 


440 


yGlnArgProSerAspGlySerSerProAspArgProGluLysArgAlaThrSerTyrGl 


4 6 0 


Db 


1585 


INMIIMMIIIIIMMMMIMIIIIIIIMIIMIIIMIIIIIIIIIIIMM 

TCAGAGGCCCAGGGACGGGTCCAGCCCAGACAGGCCAGAGAAGAGAGCCACGTCCTACCA 


1644 


Qy 


460 


HMetProLeuValGlnCysAlaSerSerSerProArgAlaGluAspLeuAlaGluAspSe 

MMIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIIIIMIIIMIIIMIIMII 


480 


Db 


1645 


GATGCCCTTGGTCCAGTGTGCCTCCTCTTCGCCGAGGGCAGAGGACCTYGCAGAAGACAG 


1704 




Qy 480 rGlySerSerLeuTyrGlyArgAlaProGlyArgHisThrTrpSerLeuLeuLeuAlaAl 5 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 

Db 17 0 5 TGGAAGCAGCCTGTATGGCCGGGCCCCTGGGAGGCACACCTGGTCCCTGCTGCTGGCTGC 17 64 

Qy 500 aLeuAlaCysLeuValProLeuLeuHisTrpAsnlleArgArg 514 

1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 

Db 17 65 ACTTGCCTGYCTTGTCCCTCTGCTGCATTGGAACATCCGCAGA 18 07 



ID 


AAC78128 standard; cDNA; 2162 BP. 




XX 






AC 


AAC78128; 




XX 






DT 


08-FEB-2001 (first entry) 




XX 






DE 


Human cancer associated gene sequence SEQ ID 


NO:522. 


XX 






PN 


WO200055350-A1 . 




XX 






PS 


Claim 1; Page 1051-1052; 2352pp; English. 




XX 






SQ 


Sequence 2162 BP; 462 A; 683 C; 594 G; 412 T; 


11 other 



Query Match 85.3%; Score 1583; DB 21; Length 2162; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1833; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 



Qy 


19 


GCGGAGGGCAGAGCGCGCGCCCAGTTGCCCGGGCACCAAATCGGAGCGCGGCGTGCGGGA 


78 


Db 


185 


M 1 M 1 [ 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 M 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1] 1 ! M [ 1 1 1 II 1 1 j 1 

GCGGAGGGCAGAGCGCGCGCCCAGTTGCCCGGGCACCAAATCGGAGCGCGGCGTGCGGGA 


244 


Qy 


79 


GGGCCCAGAGCAGGACTGGAAATGTCCTGGCCGCGCCGCCTCCTGCTCAGATACCTGTTC 


138 


Db 


245 


IIIIIMIIIMMIIIIMIIIIIMMIIIMIIIIIIIIIIMIIMMMIIMII 

GGGCCCAGAGCAGGACTGGAAATGTCCTGGCCGCGCCGCCTCCTGCTCAGATACCTGTTC 


304 


Qy 


139 


CCGGCCCTCCTGCTTCACGGGCTGGGAGAGGGTTCTGCCCTCCTTCATCCAGACAGCAGG 


198 


Db 


305 


MMMMMIIIIIIIIIIIMMMIIIIIIIMIIMIMMIMIIIIIIMIIM 

CCGGCCCTCCTGCTTCACGGGCTGGGAGAGGGTTCTGCCCTCCTTCATCCAGACAGCAGG 


364 


Qy 


199 


TCTCATCCTAGGTCCTTAGAGAAAAGTGCCTGGAGGGCTTTTAAGGAGTCACAGTGCCAT 


258 


Db 


365 


MMMMIIIIIIIMIIIIIIIMIIIIIIIMIIIMIMIIMIIIIIIIIIIMI 

TCTCATCCTAGGTCCTTAGAGAAAAGTGCCTGGAGGGCTTTTAAGGAGTCACAGTGCCAT 


424 


Qy 


259 


CACATGCTCAAACATCTCCACAATGGTGCAAGGATCACAGTGCAGATGCCACCTACAATC 


318 


Db 


425 


MIIIIMIIIMIIIIIMIIIIMIIIIIIIIIIIMIIIIIIIMIIIIIIIMIII 

CACATGCTCAAACATCTCCACAATGGTGCAAGGATCACAGTGCAGATGCCACCTACAATC 


484 


Qy 


319 


GAGGGCCACTGGGTCTCCACAGGCTGTGAAGTAAGGTCAGGCCCAGAGTTCATCACAAGG 


378 


Db 


485 


M 1 1 1 1 M M 1 1 1 1 1 M 1 II M 1 1 1 1 1! 1 f M 1 1 ! 1 M 1 M 1 1 1 If 1 1 I M 1 ! 1 1 1 1 1 1 ! 

GAGGGCCACTGGGTCTCCACAGGCTGTGAAGTAAGGTCAGGCCCAGAGTTCATCACAAGG 


544 


Qy 


379 


TCCTACAGATTCTACCACAATAACACCTTCAAGGCCTACCAATTTTATTATGGCAGCAAC 


438 


Db 


545 


MIMMIIIIIMMIMMIIIIIIMIMIIIMIMIIMIIIIIIIMMIIIM 

TCCTACAGATTCTACCACAATAACACCTTCAAGGCCTACCAATTTTATTATGGCAGCAAC 


604 




Qy 


439 


CGGTGCACAAATCCCACTTATACTCTCATCATCCGGGGCAAGATCCGCCTCCGCCAGGCC 


498 


Db 


605 


I ! M ! 1 1 M ! i 1 1 M ! M 1 ! ; i M M 1 ! M ! ! M M | I ! i ! ! ! 1 1 1 1 1 ! M | ; 1 1 M ! II 

CGGTGCACAAATCCCACTTATACTCTCATCATCCGGGGCAAGATCCGCCTCCGCCAGGCC 


664 


Qy 


499 


TCCTGGATCATCCGAGGGGGCACGGAAGCCGACTACCAGCTGCACAACGTCCAGGTGATC 


558 


Db 


665 


1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 i 1 1 1 ! i i 1 i 1 i ! ! 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

TCCTGGATCATCCGAGGGGGCACGGAAGCCGACTACCAGCTGCACAACGTCCAGGTGATC 


724 


Qy 


559 


TGCCACACAGAGGCGGTGGCCGAGAAGCTGGGCCAGCAGGTGAACCGCACATGCCCGGGC 


618 


Db 


725 


M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 M 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 
TGCCACACAGAGGCGGTGGCCGAGAAGGTCGGCCAGCAGGTGAACCGCACATGCCCGGGC 


784 


Qy 


619 


TTCCTCGCAGACGGGGGTCCCTGGGTGCAGGACGTGGCCTATGACCTCTGGCGAGAGGAG 


678 


Db 


785 


IMIIIIIIIIIMMIMIIIIMIIIIIIIMMIMIIIMIIIIIMIMIIIIM 

TTCCTCGCAGACGGGGGTCCCTGGGTGCAGGACGTGGCCTATGACCTGTGGCGAGAGGAG 


844 


Qy 


679 


AACGGCTGTGAGTGCACCAAGGCCGTGAACTTTGCCATGCATGAACTTCAGCTCATCCGG 


738 


Db 


845 


1 1 II 1 1 1 1 M 1 M i II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M M | | | | | | | || | M | | | M | | | | | 
AACGGCTGTGAGTGCACCAAGGCCGTGAACTTTGCCATGCATGAACTTCAGCTCATCCGG 


904 


Qy 


739 


GTGGAGAAGCAGTACCTTCACCACAACCTCGACCACCTGGTCGAGGAGCTCTTCCTTGGT 


798 


Db 


905 


MMIIIIIMIIIIIIIMIIIIMIIMMIIIIIIIIIIIIIMIIIIIIIIIIIII 

GTGGAGAAGCAGTACCTTCACCACAACCTCGACCACCTGGTCGAGGAGCTCTTCCTTGGT 


964 


Qy 


799 


GACATTCACACTGATGCCACCCAGAGGATGTTCTACCGGCCCTCCAGTTACCAGCCCCCT 


858 


Db 


965 


IMIIIIIIIIMIIIIIIIIIIIIIIIMMIIMIIMMIIIMIIIIMIIIMM 

GACATTCACACTGATGCCACCCAGAGGATGTTCTACCGGCCCTCCAGTTACCAGCCCCCT 


1024 


Qy 


859 


CTGCAGAATGCCAAGAACCACGACCATGCCTGCATCGCCTGTCGGATCATCTATCGGTCA 


918 


Db 


1025 


MIIIIMIIIIIMMIIIIIIIMIIIIIIIIMIIIIII IIMIMIIIMIMM 

CTGCAGAATGCCAAGAACCACGACCATGCCTGCATCGCCTGTSGGATCATCTATCGGTCA 


1084 


Qy 


919 


GACGAGCACCACCCTCCCATCCTGCCCCCAAAGGCAGACCTGACCATCGGCCTGCACGGG 


978 


Db 


1085 


MIMIIIIIIIIIIMMIIIMIIIIIIIMMIIIIIIIMIIIIIIIIIMIMM 

GACGAGCACCACCCTCCCATCCTGCCCCCAAAGGCAGACCTGACCATCGGCCTGCACGGG 


1144 


Qy 


979 


GAGTGGGTGAGCCAGCGCTGTGAGGTGCGCCCCGAAGTCCTCTTCCTCACCCGCCACTTC 


1038 


Db 


1145 


MIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMII 

GAGTGGGTGAGCCAGCGCTGTGAGGTGCGCCCCGAAGTCCTCTTCCTCACCCGCCACTTC 


1204 


Qy 


1039 


ATCTTCCATGACAACAACAACACCTGGGAGGGCCACTACTACCACTACTCAGACCCGGTG 


1098 


Db 


1205 


IMIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIMIIIIIMIIM 

ATCTTCCATGACAACAACAACACCTGGGAGGGCCACTACTACCACTACTCAGACCCGGTG 


1264 


Qy 


1099 


TGCAAGCACCCCACCTTCTCCATCTACGCCCGGGGCCGCTACAGCCGCGGCGTCCTCTCG 


1158 


Db 


1265 


IIIIMIMIIIMIIIIIIIIIIIIMIIIMIIIIMIMIIIIIIIMIIIIIMM 

TGCAAGCACCCCACCTTCTCCATCTACGCCCGGGGCCGCTACAGCCGCGGCGTCCTCTCG 


1324 


Qy 


1159 


TCCAGGGTCATGGGAGGCACCGAGTTCGTGTTCAAAGTGAATCACATGAAGGTCACCCCC 


1 0 1 P 
-L £ ± o 


Db 


1325 


Mlllllllllllllllliillllllllllllllllllllllllllllllllllllllll 

TCCAGGGTCATGGGAGGCACCGAGTTCGTGTTCAAAGTGAATCACATGAAGGTCACCCCC 


1384 


Qy 


1219 


ATGGATGCGGCCACAGCCTCACTGCTCAACGTCTTCAACGGGAATGAGTGCGGGGCCGAG 


1278 


Db 


1385 


IMIIIIIIIIIIIIIIIIIIIMIIIMIMIIMIIIIIIIIIIIIIIIIIMMMI 

ATGGATGCGGCCACAGCCTCACTGCTCAACGTCTTCAACGGGAATGAGTGCGGGGCCGAG 


1444 



Qy 12 7 9 GGCTCCTGGCAGGTGGGCATCCAGCAGGATGTGACCCACACCAATGGCTGCGTGGCCCTG 13 3 8 

Db 14 4 5 GGCTCCTGGCAGGTGGGCATCCAGCAGGATGTGACCCACACCAATGGCTGCGTGGCCCTG 15 04 

Qy 13 3 9 GGCATCAAACTACCTCACACGGAGTACGAGATCTTCAAAATGGAACAGGATGCCCGGGGG 13 98 

IIIIIIMIIIMIIMIIIIIIIIMMIIIMIIIIMIIIIIIIIIIIIIIIIMII 

Db 15 05 GGCATCAAACTACCTCACACGGAGTACGAGATCTTCAAAATGGAACAGGATGCCCGGGGG 15 64 

Qy 13 99 CGCTATCTGCTGTTCAACGGTCAGAGGCCCAGCGACGGGTCCAGCCCAGACAGGCCAGAG 14 5 8 

IlillllMIIIIMIIIIIIIIIMIIIMIMIIMIIIIIIIIMIIilllMIMI 

Db 1565 CGCTATCTGCTGTTCAACGGTCAGAGGCCCAGCGACGGGTCCAGCCCAGACAGGCCAGAG 162 4 

Qy 14 5 9 AAGAGAGCCACGTCCTACCAGATGCCCTTGGTCCAGTGTGCCTCCTCTTCGCCGAGGGCA 1518 

IMIIIMIIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIMMIMIMIIMIMI 

Db 162 5 AAGAGAGCCACGTCCTACCAGATGCCCTTGGTCCAGTGTGCCTCCTCTTCGCCGAGGGCA 16 84 

Qy 1519 GAGGACCTCGCAGAAGACAGTGGAAGCAGCCTGTATGGCCGGGCCCCTGGGAGGCACACC 157 8 

MINIM 

Db 16 8 5 GAGGACCTYGCAGAAGACAGTGGAAGCAGCCTGTATGGCCGGGCCCCTGGGAGGCACACC 174 4 

Qy 15 79 TGGTCCCTGCTGCTGGCTGCACTTGCCTGCCTTGTCCCTCTGCTGCATTGGAACATCCGC 163 8 

M M I I M II M I I I I I I I II M II I I I I 

Db 174 5 TGGTCCCTGCTGCTGGCTGCACTTGCCTGYCTTGTCCCTCTGCTGCATTGGAACATCCGC 18 04 

Qy 16 3 9 AGATAGAAGTTTTAGAAAGTTCTATTTTTCCAAACCAGGATTCCTTACTATTGACAGATT 16 98 

1 1 1 1 I 1 1 1 i I I I I I I 1 1 I I I 1 1 I M [ I I 1 1 1 1 1 1 1 I I 1 1 1 ( i I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I f I 

Db 18 0 5 AGATAGAAGTTTTAGAAAGTTCTATTTTTCCAAACCAGGATTCCTTACTATTGACAGATT 18 64 

Qy 16 9 9 TTCTTTACCAAAAGAAAAGACATTTATTCTTTTGATGCACTTGAATGCCAGAGAACTGTC 17 5 8 

I IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIIMMIIIMIIIIII 

Db 18 65 TKCTTTACCAAAAGAAAAGACATTTATTCTTTTGATGCACTTGAATGCCAGAGAACTGTC 1924 

Qy 175 9 CTTCTTTTTCTCCTCTCCCTCCCTCCCAGCCCCTGAGTCATGAACAGCAAGGAGTGTTTG 1818 

M I M M M I I II I I I I M 1 1 I I I I M I II II M I I I I M I I M I II I I I II II I M 1 1 I 

Db 192 5 CTTCTTTTTCTCCTCTCCCTCCCTCCCAGCCCCTGAGTCATGAACAGCAAGGAGTGTTTG 19 84 

Qy 1819 AAGTTTCTGCTTTGAACTCCGTCCAGCCTGATCCCTGG 18 56 

MIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIII 

Db 198 5 AAGTTTCTGCTTTGAACTCCGTCCAGCCTGATCCCTGG 2 02 2 

BM715696 ' ' ■ 1 D" 
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/organi sm= "Homo sapiens" 
/db_xref ="taxon: 9606" 
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/clone_lib="UI-E-EJO " 

/tissue_type= M f etal eyes, lens, eye anterior segment, 
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stranded cDNA was ligated to an EcoR I adaptor, digested 
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vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tags for this library are: fetal eyes, AGAATCAAGA 
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created for the program, Gene Discovery in the Visual 
System, supported by National Eye Institute (NEI). M 
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Qy 


10 


ArgTyrLeuPheProAlaLeuLeuLeuHisGlyLeuGlyGluGlySerAlaLeuLeuHis 

MIIIIMIIIIIIIIIIMIIIMIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 


29 


Db 


2 


AGATACCTGTTCCCGGCCCTCCTGCTTCACGGGCTGGGAGAGGGTTCTGCCCTCCTTCAT 


61 


Qy 


30 


ProAspSerArgSerHisProArgSerLeuGluLysSerAlaTrpArgAlaPheLysGlu 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIMIMIIIIIMII 


49 


Db 


62 


CCAGACAGCAGGTCTCATCCTAGGTCCTTAGAGAAAAGTGCCTGGAGGGCTTTTAAGGAG 


121 


Qy 


50 


SerGlnCysHisHisMetLeuLysHisLeuHisAsnGlyAlaArglleThrValGlnMet 

IIIMIIIIMIIIIIMIIIIIMIIIIIIMIIIIMIIIMIIIIIIIMIMIIII 


69 


Db 


122 


TCACAGTGCCATCACATGCTCAAACATCTCCACAATGGTGCAAGGATCACAGTGCAGATG 


181 


Qy 


70 


ProProThrlleGluGlyHisTrpValSerThrGlyCysGluValArgSerGlyProGlu 

IIIIMIIIIIMIMIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIM 


89 


Db 


182 


CCACCTACAATCGAGGGCCACTGGGTCTCCACAGGCTGTGAAGTAAGGTCAGGCCCAGAG 


241 


Qy 


90 


PhelleThrArgSerTyrArgPheTyrHisAsnAsnThrPheLysAlaTyrGlnPheTyr 


109 


Db 


242 


IIIIIMMIIIIIMIMIIIIIIIIIIIIMIMIIIIIIMMIMIIIIIMIMI 

TTCATCACAAGGTCCTACAGATTCTACCACAATAACACCTTCAAGGCCTACCAATTTTAT 


301 


Qy 


110 


TyrGlySerAsnArgCysThrAsnProThrTyrThrLeuIlelleArgGlyLysIleArg 

IIMMIIIMMIIIIIIIMIIIIIIIIIIIIIMIMIIIIIIIIIIIMMIMM 


129 


Db 


302 


TATGGCAGCAACCGGTGCACAAATCCCACTTATACTCTCATCATCCGGGGCAAGATCCGC 


361 


Qy 


130 


LeuArgGlnAlaSerTrpIlelleArgGlyGlyThrGluAlaAspTyrGlnLeuHisAsn 

IIIIIIIIIMIIIIIIIIIIIMMMIIIIIMIIMIIIIIIIIIIMIMIIIIII 


149 


Db 


362 


CTCCGCCAGGCCTCCTGGATCATCCGAGGGGGCACGGAAGCCGACTACCAGCTGCACAAC 


421 


Qy 


150 


ValGlnVallleCysHisThrGluAlaValAlaGluLysLeuGlyGlnGlnValAsnArg 


169 


Db 


422 


IMIIIIIIMIIIIIIIIIIMIMMIIIIIIIIMIIIMIIIIIIIIIIIIIMM 

GTCCAGGTGATCTGCCACACAGAGGCGGTGGCCGAGAAGCTCGGCCAGCAGGTGAACCGC 


481 


Qy 


170 


ThrCysProGlyPheLeuAlaAspGlyGlyProTrpValGlnAspValAlaTyrAspLeu 

IIIIIMIIIIMMIIMIIIIIIIIMMMIIIIIMIMMIIIIIMIIIIMM 


189 


Db 


482 


ACATGCCCGGGCTTCCTCGCAGACGGGGGTCCCTGGGTGCAGGACGTGGCCTATGACCTC 


541 


Qy 


190 


TrpArgGluGluAsnGlyCysGluCysThrLysAlaValAsnPheAlaMetHisGluLeu 


209 


Db 


542 


IIIIIIIIIIIIIIMIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIIIIIIIIIII 

TGGCGAGAGGAGAACGGCTGTGAGTGCACCAAGGCCGTGAACTTTGCCATGCATGAACTT 


601 


Qy 


210 


GlnLeuIleArgValGluLysGlnTyrLeuHisHisAsnLeuAspHisLeuValGluGlu 


229 


Db 


602 


IIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIMIIIIIMIIIMIIIIMIIMI 

CAGCTCATCCGGGTGGAGAAGCAGTACCTTCACCACAACCTCGACCACCTGGTCGAGGAG 


661 


Qy 


2 3 0 


LeuPheLeuGlyAspIleHisThrAspAlaThrGlnArgMetPheTyrArgProSerSer 

IIIIIMIIIIIIIIIMIIMIIIIIMIIIMIIMMIIIIIIIIMIIIIMIIM 


249 


Db 


662 


CTCTTCCTTGGTGACATTCACACTGATGCCACCCAGAGGATGTTCTATCGGCCCTCCAGT 


721 


Qy 


250 


TyrGlnProProLeuGlnAsnAlaLys 258 

IIIIIIIIIIIMIIIIIMIIIIIII 




Db 


722 


TACCAGCCCCCTCTGCAGAATGCCAAG 74 8 
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REFERENCE 1 (bases 1 to 798) 
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discovery 
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ORIGIN 

Query Match 35.4%; Score 657; DB 14; Length 798 ; 
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Matches 707; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 12 6 CAGATACCTGTTCCCGGCCCTCCTGCTTCACGGGCTGGGAGAGGGTTCTGCCCTCCTTCA 185 

MMIMIMMMIIMIMIIIIIIIIIIIMMIIIIIIIIIIIIMMMIIIMI 

Db 1 CAGATACCTGTTCCCGGCCCTCCTGCTTCACGGGCTGGGAGAGGGTTCTGCCCTCCTTCA 6 0 

QY 186 TCCAGACAGCAGGTCTCATCCTAGGTCCTTAGAGAAAAGTGCCTGGAGGGCTTTTAAGGA 24 5 

MIMIIIIIIIIIIIIIIIIIMIMIIIMIIIIIMIMIIMIIIIIIIIIIIIII 

Db 61 TCCAGACAGCAGGTCTCATCCTAGGTCCTTAGAGAAAAGTGCCTGGAGGGCTTTTAAGGA 12 0 

Qy 24 6 GTCACAGTGCCATCACATGCTCAAACATCTCCACAATGGTGCAAGGATCACAGTGCAGAT 3 05 

MMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIMIIIMIIIIIIIIIIIIII 

Db 121 GTCACAGTGCCATCACATGCTCAAACATCTCCACAATGGTGCAAGGATCACAGTGCAGAT 18 0 

Qy 3 06 GCCACCTACAATCGAGGGCCACTGGGTCTCCACAGGCTGTGAAGTAAGGTCAGGCCCAGA 3 65 

MIIMMIMMMMMIIMIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIII 

Db 181 GCCACCTACAATCGAGGGCCACTGGGTCTCCACAGGCTGTGAAGTAAGGTCAGGCCCAGA 24 0 

Qy 3 6 6 GTTCATCACAAGGTCCTACAGATTCTACCACAATAACACCTTCAAGGCCTACCAATTTTA 4^5 

IMMMMIIIIIIIIIIIIIIIIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIll 

Db 241 GTTCATCACAAGGTCCTACAGATTCTACCACAATAACACCTTCAAGGCCTACCAATTTTA 3 00 

Qy 4 2 6 TTATGGCAGCAACCGGTGCACAAATCCCACTTATACTCTCATCATCCGGGGCAAGATCCG 4 85 

MMIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIMIIIIIIIilllllllllli 

Db 3 01 TTATGGCAGCAACCGGTGCACAAATCCCACTTATACTCTCATCATCCGGGGCAAGATCCG 3 60 

Qy 4 86 CCTCCGCCAGGCCTCCTGGATCATCCGAGGGGGCACGGAAGCCGACTACCAGCTGCACAA 54 5 

IlilMilMIIIIIIIIIMIIIIIIIIIIIIIIllliiiiiMiiiiiiiiiiiiiii 

Db 361 CCTCCGCCAGGCCTCCTGGATCATCCGAGGGGGCACGGAAGCCGACTACCAGCTGCACAA 42 0 

Qy 546 CGTCCAGGTGATCTGCCACACAGAGGCGGTGGCCGAGAAGCTGGGCCAGCAGGTGAACCG 6 05 

IMIIIMIMIIIIIIillMMIMIIIIIIIMllMII I I I I I I I I I I I I I I I I I 

Db 421 CGTCCAGGTGATCTGCCACACAGAGGCGGTGGCCGAGAAGCTCGGCCAGCAGGTGAACCG 4 80 



Qy 606 CACATGCCCGGGCTTCCTCGCAGACGGGGGTCCCTGGGTGCAGGACGTGGCCTATGACCT 665 

Ob 481 CACATGCCCGGGCTTCCTCGCAGACGGGGGTCCCTGGGTGCAGGACGTGGCCTATGACCT 54 0 

Qy 666 GTGGCGAGAGGAGAACGGCTGTGAGTGCACCAAGGCCGTGAACTTTGCCATGCATGAACT 72 5 

IIIIIIIIMIIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 541 CTGGCGAGAGGAGAACGGCTGTGAGTGCACCAAGGCCGTGAACTTTGCCATGCATGAACT 6 00 

Qy 72 6 TCAGCTCATCCGGGTGGAGAAGCAGTACCTTCACCACAACCTCGACCACCTGGTCGAGGA 7 85 

Db 601 TCAGCTCATCCGGGTGGAGAAGCAGTACCTTCACCACAACCTCGAGCACCTGGTCGAGGA 6 60 

Qy 7 86 GCTCTTCCTTGGTGACATTCACACTGATGCCACCCAGAGGATGTTCTA 83 3 

IMIIIIIIIIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 661 GCTCTTCCTTGGTGACATTCACACTGATGCCACCCAGAGGATGTTCTA 708 



